Skip to content

niklas-pihl: Implemented optimized matrix multiplication#11

Open
pihlnikl wants to merge 3 commits into
parallelcomputingabo:mainfrom
pihlnikl:niklas-pihl
Open

niklas-pihl: Implemented optimized matrix multiplication#11
pihlnikl wants to merge 3 commits into
parallelcomputingabo:mainfrom
pihlnikl:niklas-pihl

Conversation

@pihlnikl
Copy link
Copy Markdown

@pihlnikl pihlnikl commented May 6, 2025

  • Implemented optimizations based on the provided pseudocode
  • Blocked multiplication didn't seem to be faster with any of the recommended block sizes, otherwise it seems to work correctly.
  • Block size: 128
  • Thread number: 8

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant